ciently on Average , in Parallel , and by Asking Queries ?
نویسندگان
چکیده
A pattern is a string of constant and variable symbols. The language generated by a pattern is the set of all strings of constant symbols which can be obtained from by substituting non-empty strings for variables. We study the learnability of one-variable pattern languages in the limit with respect to the update time needed for computing a new single guess and the expected total learning time taken until convergence to a correct hypothesis. The results obtained are threefold. First, we design a consistent and set-driven learner that, using the concept of descriptive patterns, achieves update time O(n 2 log n), where n is the size of the input sample. The best previously known algorithm to compute descriptive one-variable patterns requires time O(n 4 log n) (cf. Angluin [1]). Second, we give a parallel version of this algorithm requiring time O(logn) and O(n 3 = log n) processors on an EREW-PRAM. Third, we devise a one-variable pattern learner whose expected total learning time is O(` 2 log `) provided the sample strings are drawn from the target language according to a probability distribution D with expected string length `. The distribution D must be such that strings of equal length have equal probability, but can be arbitrary otherwise. Thus, we establish the rst one-variable pattern learner having an expected total learning time that provably di ers from the update time by a constant factor only. Finally, we apply the algorithm for nding descriptive one-variable patterns to learn one-variable patterns with a polynomial number of superset queries with respect to the one-variable patterns as query language.
منابع مشابه
A fuzzy reliability model for series-parallel systems
Fuzzy set based methods have been proved to be effective in handling many types of uncertainties in different fields, including reliability engineering. This paper presents a new approach on fuzzy reliability, based on the use of beta type distribution as membership function. Considering experts' ideas and by asking operators linguistic variables, a rule base is designed to determine the level ...
متن کاملPartitioning Algorithms for the Computation of Average Iceberg Queries
Iceberg queries are to compute aggregate functions over an attribute (or set of attributes) to nd aggregate values above some speci ed threshold. It's di cult to execute these queries because the number of unique data is greater than the number of counter buckets in memory. However, previous research has the limitation that average functions were out of consideration among aggregate functions. ...
متن کاملIdentification of Partial Disjunction, Parity, and Threshold Functions
Let F be a class of functions obtained by replacing some inputs of a Boolean function of a xed type with some constants. The problem considered in this paper, which is called attribute e cient learning, is to identify \e ciently" a Boolean function g out of F by asking for the value of g at chosen inputs, where \e ciency" is measured in terms of the number of essential variables. We study the q...
متن کاملParallel Generation of t-ary Trees
A parallel algorithm for generating t-ary tree sequences in reverse B-order is presented. The algorithm generates t-ary trees by 0-1 sequences, and each 0-1 sequences is generated in constant average time O(1). The algorithm is executed on a CREW SM SIMD model, and is adaptive and cost-optimal. Prior to the discussion of the parallel algorithm a new sequential generation with O(1) average time ...
متن کاملActive Learning of Classes of Recursive Functions by Ultrametric Algorithms
We study active learning of classes of recursive functions by asking value queries about the target function f , where f is from the target class. That is, the query is a natural number x, and the answer to the query is f(x). The complexity measure in this paper is the worst-case number of queries asked. We prove that for some classes of recursive functions ultrametric active learning algorithm...
متن کامل